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Abstract. We investigate the possibility of virtual community formation in an online social network under 
a rapid increase of activity of members and newcomers. The evolution is studied of the activity of online 
users at The Guardian - Comment Is Free forum - covering topics related to the Middle East turmoil 
during the period of 1^* of January 2010 to the 28*^ of March 2011. Despite a threefold upsurge of forum 
users and the formation of a giant component, the main network characteristics, i.e. degree and weight 
distribution and clustering coefficient, remained almost unchanged. 
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1 Introduction 



of connections, transitive closure of triples, relative high 
node degrees and connection into a large component. 



The properties of social networks that are generated through 
the internet discussions of almost all forms, such as e-mails 
[1], blogs [2], discussion forums [3,4 , bulletin board sys- 
tem [5l[6] and more recently online social networks [7,8 
have been studied over last decade. 

In recent years much interest has been expressed, par- 
ticularly in the media, about the role which on line social 
networks may play in the mobilisation of social protest. 
The recent Middle East turmoil is no exception. However, 
there has been little direct documentation of the implied 
upsurge of network traffic occasioned by turbulent events. 
This includes both those directly involved in such events 
and those who merely care to reflect upon the events. 

One of the most intriguing questions is the extent to 
which virtual communications might provide, at least in 
part, the foundations for a virtual community which, in 
turn, may eventually mobilise in the real world. It is be- 
yond the ambitions of this paper to answer this very broad 
question but clearly it is pertinent to initially ask whether 
one can flnd evidence for the formation of virtual commu- 
nities leaving it open as to whether the next step might 
or might not be taken. This brief paper reports upon a 
study of the Guardian (a British daily newspaper) inter- 
net forum- Comment is Free- in order to ascertain the 
impact of the events precipitated by the social unrest and 
change of Government in Tunisia. 

Standard network (graph and di-graph) theories [9 
would imply that tendencies towards community forma- 
tion would be detected by the evolution of a high density 



Send offprint requests to: 



2 Data 

A web crawler software was designed to download the com- 
plete content of both journalists' articles and public dis- 
cussions occurring at the Guardian int ernet foru m called 

Comment Is Free (http://www.guardian.co.uk/commentisfree7alll 
during the period from the 1*^ of January 2010 to the 28^^^ 
of March 2011. 

Each day during that period articles appearing in The 
Guardian newspaper were published on-line and made avail- 
able for public comment at the Comment is Free web-page. 
Every article was authored, dated and carried a keyword 
tagging its subject matter, e.g. UK Politics or Barack 
Obama. 

All those wishing to comment on any article are re- 
quired to register; thus, all commentators are identifled. 
Each comment carries its author's nickname and date of 
posting. In addition each comment can attract a number 
of recommendations by other commentators which pro- 
vides an index of the popularity of any particular com- 
ment. Comments always appear, in chronological order, 
directly under the main article and can be registered for 
a period of up to four days after the article is posted, but 
not thereafter. 

This paper focuses upon all the articles, which were 
tagged by one of six keywords: Tunisia, Ben Ali, Egypt, 
Mubarak, Libya and Gaddafi. During the period studied, 
279 journafists' articles were found that matched these 
tags, which then generated 35,942 comments posted by 
6,217 identiflable commentators. 



2 



B. Kujawski and P. Abell: The Middle East Revolutions at the Guardian Forum: Comment Is Free 



1600 

1400 - b) 
1200 - 
w 1000 - 

800 - 

1 600 - 
O 400 : 

200 - 

:w.Jwv 



1/12/2010 30/01/2011 




01/01/2010 01/04/2010 30/06/2010 




01/04/2010 30/06/2010 



Fig. 1. The time series over the period of study of (a) the 
number of tagged published articles and (b) the number of 
comments. Inset in (b) is a zoomed series depicting greater 
detail. The vertical lines in (b) indicate, in chronological order, 
the major political events which are shown in Tab. [T] 



Table 1. The major political events during the study period. 



Date 



Event 



20/12/2010 Tunisia - Uprising of protest 

14/01/2011 Tunisia - Ben Ah fled the country 

25/01/2011 Egypt - Beginning of revolution 

11/02/2011 Egypt - Mubarak resigned from office 

15/02/2011 Libya - Gaddafi confronted peaceful protest 

27/02/2011 Egypt - PM Ghannouchi resigned 

17/03/2011 Libya - UN approved no-fiy zone 



3 Linking Commentators 

Internet discussion might appear to be an ideal candi- 
date for a 2-mode network analysis [10 , 11 whereby all the 
commentators who post comments on the same article are 
pair- wise connected. Such a network does not, though, in 
any way reflect interpersonal interactions, but only a com- 
mon interest which might be interpreted as a necessary, 
though far from sufficient, condition for the existence of an 
embryonic virtual community. Such networks have, never- 
theless, been used to infer interpersonal relationships [12]. 
However, the average number of distinct commentators 
per article in our dataset is 69 and the 2-mode network 
structures generated would have average degrees and net- 
work density parameters, way beyond those directly ob- 
served in interpersonal social networks. They are accord- 
ingly probably rather unreliable indicators of any on line 
social bonding. 

In order to construct a directed (1-mode) social net- 
work structure, advantage can be taken of the fact that 
commentators typically attach to their comments the nick- 
name associated with the comment towards which they 
wish to direct their remarks. The construct ©nickname, 
has became an official standard among internet discussion 
users across many different internet discussion platforms. 

Thus, a weighted and directed link can be established 
between A and B when commentator A posts a comment 
that contains the nick name of commentator B which is 
attached to a prior comment concerning the same article. 
A weighting of the link is then computed as the num- 
ber of times A comments on B.All links are directed and 
stamped by the time of the first direct communication by 
A directed to B. All commentators are also time-stamped, 
appearing in the evolving network for the first time when 
they comment upon another commentator. 



Fig. [T^ depicts the time series of the number of appro- 
priately tagged articles appearing in the forum and reveals 
a sharp upturn shortly after the beginning of 2011. 

Fig. [Hd, which depicts the frequency of posted com- 
ments over the specified study period, is accompanied by 
the date of certain major events (Tab. [T]), post January 
of 2011. The insert accompanying Fig.[T]3 zooms in to the 
active period, giving a clearer picture of the incidence of 
comments. These time series refiect the upsurge in arti- 
cles shown at Fig. [1^ with a clear and precipitate increase 
in frequency of comments after about 1/1/2011, the occa- 
sion of the Tunisian uprising. The fiuctuating increase in 
frequency seems also to be stimulated by each of the iden- 
tified subsequent major events. The correlation of the time 
series in Figs.[T^ and[T]3, post 1/1/2011 is 0.88 (though, in- 
terestingly, it is 0.93 for the full period). This refiects the 
fact that before the upturn only a few immediate com- 
ments are provoked whereas afterwards the more frequent 
comments are spread over four days. Articles then, on av- 
erage, cast a more sustained commentary shadow. 



4 Results 

The evolution of some major parameters of the evolving 
directed network, which my be interpreted as indicative of 
an emerging virtual community, are depicted in Figs.[2]a-d. 
An animation of the networks evolution can also be found 
at: http://www.youtube.com/watch?v=L7t-Fj-EAZs, 

Fig. [2^ exhibits a sharp upturn in the number of com- 
mentators (i.e. the size of the node set) registering com- 
ments at the date already identified. Similarly, Figs. ^ 
and [2t show upturns in respectively the number of di- 
rected edges and the average degree of nodes (computed 
for the sake of simplicity as. the average number of links 
incident in and from a node) in the evolving structure. 
These are probably the results one would anticipate. 

Fig. [2]i depicts the movement of node clustering (i.e. 
C, the proportion of 3-cycles irrespective of the direction 
of links which are closed/transitive). Clustering is usu- 
ally taken to be indicative of the formation of highly con- 
nected cliques or groups of nodes and, thus, community 
formation. Although there is a mild upturn in clustering at 
around 1/1/2011 there was also one a year before. There 
appears, therefore, to be only a slight tendency towards 
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Fig. 2. Time series of (a) the number of nodes, (b) the number 
of directed edges, (c) the average degree < k > (d) and the 
clusterine: coefficient. 
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Fig. 3. Evolution of a Giant Component (GC), (a) its size, 
(b) overall number of components, (c) GC size / N ratio, (d) 
average shortest path in GC and (e) the diameter of GC. 



virtual clique formation at the start of 2011. The cluster- 
ing coefficient hovers around 0.05 throughout the observa- 
tion period and both maintains a low absolute value and 
fails to dramatically increase. Thus, despite the dramatic 
change in both the number of nodes (commentators) and 
edges after the start of 2011, there little support in these 
results for the view that a virtual grouping is in formation. 
This conclusion should, however, be treated with consid- 
erable caution. There may be no need for a connected 
group of individuals to exhibit online clustering in order 
to provide a basis for a latent virtual community. It seems 
sensible to turn in the direction of connectivity. 

In order to study the emergence of components all links 
were treated as undirected. That is to say any two nodes 
are deemed to be connected if there is a link in either or 
both directions. 

Fig. [3^1 depicts the evolution of the size (i.e. number 
of nodes) of the giant (i.e. the largest) component; Fig. 
[Sb depicts the total number of components and Fig. [Sh 
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Fig. 4. Degree distribution in the revolution discussion net- 
work for the final 5 months. The distribution was fitted by 
power law distribution (red dashed curve) f{x) = ax~^ , where 
7= 1.78 ±0.02. 



the fraction of the nodes contained in the giant compo- 
nent. Figures [3]i and[3fe show the time series of the aver- 
age shortest path length and the network diameter, (both 
across components) respectively. The vertical line in all 
the diagrams indicates the date of maximum number of 
components (324) which was reached at 02/02/2011. Af- 
ter this date we observe a percolation connecting isolated 
components and at the termination of observation period 
96% of nodes belong to the giant component. These results 
strongly support the conclusion that quickened commen- 
tary leads to the emergence of a large component embrac- 
ing most of the commentators. The average shortest path 
length declines very slightly but the emergence of a domi- 
nating large component appears to have little impact upon 
this parameter; likewise the diameter (i.e. the longest path 
in a component). 

Fig. |4] depicts the degree distributions across the exist- 
ing nodes for different dates. The distributions appear re- 
markably consistent across time showing, at all the recorded 
dates, more variability at the higher levels of degree. Thus, 
although the number of directed links increases rapidly 
(Fig. [2]3) after the start of 2011 the average degree does 
not correspondingly increase because of the increasing num- 
ber of nodes (Fig. [2^) (i.e. commentators). 

This conclusion is not altered by taking account of the 
weightings on the directed links (Fig. [5]). One might expect 
that, if a virtual community were to have been in forma- 
tion, then the intensity of weightings would increase with 
the advent of the focal events. Again there is no support 
for the supposition. 

The distribution of weightings at the end of the obser- 
vation period is as shown at Tab. [2j 

These figures provide little support for the view that a 
virtual community is under formation. Very few commen- 
tators communicate with one another on more than two 
occasions. Moreover, the majority of edges (direct com- 
munication between two commentators) occurred only in 
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Fig. 5. The distribution of the directed edge weights for last 
5 months of the observation period. The maximum hnk weight 
found is 18. The distribution was fitted by power law distribu- 
tion (red dashed curve) f(x) = ax~^ , where f3 = 3.20 =b 0.03. 

Table 2. The final percentage of edges with given edge weight. 



Weight 


% of edges 


1 


83.0 


2 


11.0 


3 


3.0 


4 


1.3 


5 


0.6 


> 5 


1.1 



respect of a single article; only 378 out of 10,021 edges 
(3.77%) were found in more than one article. This again 
indicates little durability in the few relationships which 
are formed. 



5 Conclusions 

An analysis of the communication between commentators 
on a web site associated with a major national newspaper 
show an expected quickening of interest in articles relat- 
ing to specified major international events. The question, 
thus, arises as to whether those involved might show signs 
of forming a virtual community. An inspection of clus- 
tering, degree distributions and the distribution of the 
weighting interpersonal communications offer little sup- 
port for this proposition though a large component em- 
bracing most of the nodes is generated. Tab. [3] gives a 
summary of the various properties of the evolved network 
at the termination of the study period. 

The lack of community formation may, however, be at- 
tributed to the nature of the discussion forum driven, as 
it is, by published journalists articles. On one hand com- 
mentators can easily comment upon an article, or direct 
their comment to another commentator, but on the other 
hand the web-page does not facilitate any other form of 



Table 3. Summary of the social network properties at the end 
of the observation period (28/03/2011). 



Property 


Value 


Nodes N 


3239 


Edges E 


10011 


Average degree < k > 


5.18 


Clustering C 


0.056 


Giant Component size 


3109 


Components 


64 


GC size / Nodes 


0.96 


Modularity 


0.334 


Communities 


755 


Degree dist. Exponent 7 


1.78 ± 0.02 


Weight dist. Exponent /S 


3.20 ± 0.03 



direct communication. A commentator has no other op- 
tion than to browse through all articles and all associated 
comments to check if one of his/her friends has posted 
a comment. Moreover, they cannot communicate directly 
and thus, depend entirely upon the frequency of postings 
by their favourite journalists. Closing the possibility of 
posting after 4 days can also reduce likelihood of further- 
ing discussion and thus the generation of a virtual commu- 
nity. This lead to the conclusion that not every web page 
that provides users discussion place is capable of forming 
a virtual community. We identify two key elements that 
are missing at Comment Is Free forum and which seems 
to be vital from the perspective of an online forum user; 
a freedom of originating of a subject of a discussion and 
an effective way of managing online acquaintances. 

Whether the results documented here provide grounds 
upon which inferences about real communities can be made 
remains an open question. It will prove interesting to com- 
pare these results with a similar analysis appertaining to 
other notable events. 
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